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Abstract 

The aim of this worl< was to map and identify the Phytophthora sojae AvrTc gene. Progeny from a cross of P. sojae strains 
ACR10xP7076 were tested for virulence on plants carrying Rpslc. Results indicate that avirulence segregates as a dominant 
trait. We mapped the Avrlc locus by performing whole genome re-sequencing of composite libraries created from pooled 
samples. Sequence reads from avirulent (Pooll) and virulent (Pool2) samples were aligned to the reference genome and 
single nucleotide polymorphisms (SNP) were identified for each pool. High quality SNPs were filtered to select for positions 
where SNP frequency was close to expected values for each pool. Only three SNP positions fit all requirements, and these 
occurred in close proximity. Additional DNA markers were developed and scored in the F2 progeny, producing a fine 
genetic map that places Avrlc within the Avrla gene cluster. Transient expression of Avrlc or Avrla triggers cell death on 
Rpslc plants, but Avrlc does not trigger cell death on Rpsla plants. Sequence comparisons show that the RXLR effector 
genes Avrlc and Avrla are closely related paralogs. Gain of virulence on Rpslc in P. sojae strain P7076 is achieved by gene 
deletion, but in most other strains this is accomplished by gene silencing. This work provides practical tools for crop 
breeding and diagnostics, as the Rpslc gene is widely deployed in commercial soybean cultivars. 
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introduction 

Plants are challenged by a diversity of pathogens and rely on 
pre-existing and elaborated defense mechanisms including innate 
immunity. Pathogens have developed effector arsenals to success- 
fully colonize plants and overcome plant defenses. Host plants 
detect microbe-associated molecular patterns (MAMPs) and 
pathogen effector molecules by use of immune receptors 
positioned on the outer plasma membrane or inside the cell 
cytoplasm [1]. The activation of pattern-triggered immunity (PTI) 
or effector-triggered immunity (ETI) leads to defense activation 
and pathogen containment. In many cases host immune receptors 
and pathogen effector proteins underlie resistance [Rj and 
avirulence [Am) gene interactions described by plant pathologists. 

Soybean resistance to the root rot pathogen Phytophthora sojae is 
determined in part by the presence of Rps genes. P. sojae effectors 
that are recognized by Rps proteins and cause ETI are Avr factors. 
As key elements that determine disease outcome on soybean, the 
Avr factors of P. sojae have been a focus of study and targeted for 
identification [2] . Several P. sojae Avr genes have been identified to 
date including Avrla [3], Avrlb [4], Avrld [5], Avrlk [6], Avr3a/ 5 
[3,7], AvrSb [8], Avr3c [9], and Avr4/6 [10]. Each Avr gene is 



predicted to encode a small secreted protein containing a signal 
peptide followed by an RXLR (Arg-any amino acid-Leu-Arg) 
sequence. This conserved motif is proposed to have a role in the 
delivery of the effector protein to the host cell cytoplasm [11,12]. 
Additional conserved features in the C-terminal or effector 
domains of the proteins are suggested to be important functional 
elements [13,14]. 

The identification of P. sojae Avr genes has relied on map-based 
approaches. Segregating F2 populations arising from outcrosses of 
parental strains that differ in virulence on particular Rps genes 
have provided the basis for investigations into Avr determinants. 
This core strategy has been assisted by strain-specific associative 
mapping, whole genome sequence information, transcript profil- 
ing, and the annotation of some 400 predicted P. sojae RXLR 
effectors known as Avh genes [15-17]. Selecting candidate Avh 
genes a priori can accelerate mapping and identification but this 
tactic is not fail-safe. Bulked segregant analysis (BSA) or pooled 
sampling is a powerful method that aids de novo mapping by 
providing genetic markers linked to gene targets [18,19]. Recently, 
pooling has been combined with deep sequencing technologies to 
rapidly map and identify target regions in a variety of organisms 
[20-22]. 
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The objective of the present study was to identify the P. sojae 
Avrlc locus. A genetic cross of P. sojae strains AGR10xP7076, 
which differ in virulence on Rpslc, was employed to track the 
segregation o{ Avrlc in F] and F2 progeny. Candidate Avh genes 
were selected for mapping but this was not successful, as none of 
the candidates co-segregated with Avrlc. We resorted to mapping 
by performing BSA and single nucleotide polymorphism (SNP) 
analysis of whole genome re-sequencing data. This led to the 
finding that Avrlc is part of the Avrla gene cluster. The Avrlc gene 
encodes a predicted RXLR effector protein of 126 amino acids. 
The Avrla and Amlc genes are close paralogs that are nearly 
identical in sequence at their N-terminal regions but that have 
diverged substantially in their C-terminal effector domains. 

Results 

Avirulence towards Rpslc is a dominant trait in P. sojae 
cross ACR10xP7076 

To determine how virulence towards Rpslc is inherited in P. 
sojae, we analyzed progeny from outcrosses between strains. The P. 
sojae strains ACRIO and P7076 differ in virulence towards soybean 
plants carrying the Rpslc gene; ACRIO is avirulent while P7076 is 
virulent. Both strains are virulent towards control plants that lack 
Rpslc, such as cv Williams (no known Rps genes) or Harosoy 
[Rpsl). Outcrosses of ACRIO x P7076 were performed and the 
resulting Fi and F2 progeny were tested for virulence towards 
Rpslc. From a total of 25 F] progeny, an avirulent:virulent 
phenotypic ratio of 23:2 was observed. Testing of 40 F2 progeny 
derived from a single Fi individual resulted in consistent readouts 
for 28 of the progeny, whereas 12 Fg progeny could not be reliably 
scored for virulence on Rpslc, or had lost virulence toward control 
plants lacking the Rpslc gene. For the 28 F2 progeny with reliable 
phenotypes and that retained virulence towards the control plants, 
the avirulentivirulent ratio towards Rpslc was 22:6 (Chi-squared 
probability = 0.66 for 3:1). Overall, the results suggest that 
avirulence towards Rpslc segregates as a dominant trait in outcross 
between P. sojae strains ACRIO xP7076. 

Selected candidate Avh genes do not co-segregate 
with Avrlc 

To accelerate the identification of Avr genes, pre-selecting 
candidate genes has proven a successful tactic in many cases 
[5,8,9,23-25]. Briefly, candidates are chosen based upon sequence 
characteristics, expression and polymorphism data and tested for 
co-segregation or association with the virulence trait, or direcdy 
screened for functional interaction with selected _ff-genes. We 
selected a total of 24 Avh genes as candidates for Avrlc, based upon 
comparison oi Avh gene sequences in the reference P. sojae strain 
P6497 and re-sequenced strains P7064, P7074 and P7076, as 
shown in Supplementary Table SI. Strains P6497, P7064, and 
P7074 are avirulent, while strain P7076 is virulent on Rpslc. 
Cleaved amplified polymorphic (CAP) markers were designed to 
score each of the 24 candidate Arh genes in the ACRIO xP7076 F2 
progeny. Unfortunately, none of the 24 candidate Avh genes tested 
co-segregated with Avrlc, so this approach was not successful. 

Deep sequencing of progeny pools identifies SNPs linked 
to Avrlc 

The de novo mapping of Avrlc was accomplished using next- 
generation sequencing and BSA applied to two pools of F2 
progeny [18,20,21]. Equal amounts of DNA from six avirulent 
(Pooll) or six virulent (Pool2) F2 individuals were combined to 
construct two composite genomic libraries for deep sequencing 



and SNP analysis. In excess of 150 million sequence reads were 
obtained from each of the pools, as detailed in Table 1 and Figure 1 . 
Reads were aligned to the reference genome P6497 and SNPs called 
from the resulting alignment map. Mapping SNPs linked to Avrlc 
proceeded under the following assumptions. Based on the dominant 
phenot^pe obser\'ed for Avrlc, the avirulent pool (Pooll) was 
expected to contain both alleles ot Avrlc and therefore consist of a 
mix of homozygous dominant and heterozygous genotypes in a ratio 
of 1 :2. The expected non-reference base allele frequency for a causal 
SNP in this pool should then be the average of one homozygous 
dominant allele (0) and two heterozygous alleles (0.5) for an 
expected frequency of 0.33. However, the virulent pool (Pool2) 
should consist of exclusively homozygous recessive genotypes with 
only the virulent allele of Avrlc represented and thus possess a non- 
reference base allele frequency of 1.0 for the causal SNP. These 
numbers may vary slightly due to effects in sampling, sequencing 
errors or mapping accuracy. However, both the causal SNP for 
virulence as well as nearby SNPs genetically linked to the loci should 
exhibit allele frequencies very close to these expectations. 
Conversely, for SNPs linked to the avirulent allele, the reciprocal 
values of 0.67 (Pooll) and 0.0 (Pool2) would be expected. Therefore, 
our causal SNP and those linked to it should be identifiable by 
selecting for SNPs that exhibit the expected allele frequency in each 
respective pool. 

With this is mind, a sequence alignment map (SAM) was 
generated by aligning reads from each sequenced pool to the 
P6497 reference. The SNPs were then called using Samtools and 
filtered for quality (Phred quality > 30). A total of I.I07 (Pooll) 
and 434 (Pool2) high quality SNPs were obtainc-d and filtered 
using a window of acceptable allele frequencies based on the 
expected values for each pool. A window of acceptable non- 
reference allele frequency values ranging from 0.18 to 0.48 was 
applied to Pooll SNPs, and a window of 0.9 to 1.0 was used to 
select for Pool2 SNPs. The reciprocal values were also used in 
order to capture SNPs linked to either allele. By this criteria, 547 
(Pooll) and 204 (Pool2) SNP positions were found, but only three 
SNPs exhibited allele frequencies that fit both pool expectations, as 
shown in Figure lA. Remarkably, as would be expected from 
genetic linkage, all three of the identified SNPs occur in close 
physical proximity within a 92 kb segment on Scaffbld_7 in the 
P. sojae reference genome assembly (v5.0) (Figure IB). 

Avrlc is located in the Avrla gene cluster 

The SNPs identified in our mapping occurred in close proximity 
in a region rich in Avh genes, including the previously identified 
Avrla locus, as shown in Figure 2A. In order to verify that Avrlc 
occurs in this genome region and to more accurately map its 
position, a set of DNA markers was developed based upon 
polymorphic sites identified by comparison of the P. sojae reference 
genome and sequence reads from each of the pools. Useful 
markers were also available from previous mapping work on Avrla 
[3,26]. Each marker was scored in a mapping population of 28 F2 
progeny. The results indicate thut Avrlc maps precisely to the Avrla 
locus (Figure 2B). 

The predicted RXLR effector gene Avh275c corresponds 

to Avrlc 

Previous work on Avrla indi[:ated that this locus is highly 
polymorphic and displays copy number variation among P. sojae 
strains [3]. The reference strain P6497 contains a tandem array of 
four related 5.2 kb segments, as shown in Figure 2C. Two of these 
segments include copies ot Avrla while adjacent segments include 
paralogous genes named Avh275c and Avh72. The Avh72 open 
reading frame includes a frame-shift and no transcripts can be 
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Figure 1. Identification of SNPs linl<ed to Avric by bulked 
segregant analysis and deep sequencing. A, The procedure for 
discovery of candidate SNPs linked to Avric is shown. Selected F2 
progeny from a cross of P. sojae strains ACR10xP7076 were pooled 
according to their virulence phenotype. The avirulent (A) Pooll and 
virulent (V) Pool2 composite DNA samples were deeply sequenced. 
Sequence reads were aligned against the reference genome, and SNPs 
were identified and filtered based upon quality scores. High quality 
SNPs were further filtered according to the predicted SNP frequencies 
for Pooll and Pool2. After processing, only three SNPs passed all 
requirements. B, Genome location of three candidate SNPs. These three 
candidate SNPs occur in close physical proximity in the reference 
genome assembly. All three sites fall within a 92 kb segment on 
Scaffold_7 (V5.0). Reference (Ref) allele and alternate (Alt) alleles for 
three SNPs are shown. The SNP frequencies (Freq) in each pool are also 
shown. 

doi:1 0.1 371 /journal.pone.0089738.g001 

detected for this gene, indicating that Avh72 is a pseudogene. 
However, the Avh275c open reading frame is intact and transcripts 
can be detected for this gene in several P. sojae strains. The Avh275c 
gene encodes a predicted RXLR effector protein of 126 amino 
acids. Results described below indicate that Avh275c corresponds 



to Avric, thus for simplicity, we will hereafter refer to the Avh275c 
gene as Avric. 

To investigate the structure of the Avric locus among P. sojae 
strains, genomic DNA was digested with the restriction enzyme 
PmK, blotted, and hybridized with a probe that detects Avh72, 
Auric and Avrla. The PmK digestion is known to resolve each of the 
four copies in the tandem array, including Avh72, Avric, and the 
two copies oi Avrla, as we have previously demonstrated [3]. The 
results shown in Figure 3 indicate that P. sojae strain P7076 lacks a 
copy of the Avric gene, but that all other strains tested possess a 
copy of Avric. Additionally, data from re-sequencing of the 
parental strains ACRIO and P7076 was used to measure read 
coverage across the Avric segment, as constructed for the reference 
strain P6497 [3]. As is shown in Figure 4, there is no coverage in 
the Avric region in virulent strain P7076, while this region is 
covered by sequence reads in the avirulent strain ACRIO. Thus, 
the DNA blot and the sequence coverage analyses indicate that 
Avric is deleted from P. .sojae strain P7076 but that copies oiAvh72 
and Avrla remain in this tandem gene cluster. 

Sequence analysis revealed three different Avric alleles within 
our P. sojae strain collection (Figure SI). These include the Avrlc-1 
allele (represented by strain P6497; also present in strain ACRIO), 
the Avrlc-2 allele (represented by strain P7064) and the Avrlc-S 
allele (strain P7074). The Avrla DNA sequence encoding the open 
reading frame for the protein is not polymorphic among known 
strains of P. sojae that carry copies of the gene. Comparison of the 
predicted amino acid sequences of Avr 1 a and each of the Avr 1 c 
alleles is shown in Figure 5. The N- terminal portions of the Avrla 
and Avric predicted proteins are nearly identical, while the C- 
terminal regions have diverged substantially. Among the Avric 
alleles, Avrlc^'"^* displays four amino acid changes (S113G, 
K114R, I115L,V120G) and Avrlc^™'"^ displays five changes 
(L69Q, S82P, SI 13G, Kl 14R, II 15L), compared to the reference 
allele Avrlc^*'*^'^. 

To test for the presence of Avric mRNA transcripts among 
P. .sojae strains, we performed reverse transcriptase polymerase 
chain reaction (RT-PCR). The Avrla transcript was also measured 
for comparison since these two genes occur adjacent to each other. 
Results show that transcripts of Avric and Avrla are present in 
strains that are avirulent towards Rpslc and Rpsla, respectively, 
but not in the corresponding virulent strains. One exception is 
strain ACR9, which expresses the Avric transcript but nonetheless 
is virulent on Rpslc (Figure 6). 

Transient expression of /Avr /c triggers cell death in Rpslc 
plants 

Transient expression of an Avr effector protein in a soybean 
plant containing a matching Rps gene triggers cell death. In order 



Table 1. Whole genome re-sequencing of connposite and strain specific P. sojae libraries. 





Sample 


Total reads^ (r 


nlllions) 


Q30^ (%) 


Mapped reads^ (%) 


Average Coverage* 


Pooll 


153.3 




84.3 


95.9 


180 


P00I2 


205.9 




84.1 


95.8 


241 


ACRIO 


164.0 




89.0 


97.6 


196 


P7076 


191.7 




87.2 


98.3 


230 



'lllumina HiSeq 2000, read length 100 bp. 
^Sequence reads with quality score >30. 

^Sequence reads that map to the 82 Mb P. sojae reference genome assembly. 
''Average depth of sequence read coverage. 
doi:l 0.1 371 /journal.pone.0089738.t001 
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Figure 2. Genetic and physical mapping of Avrlc region. A, Physical map of Avrlc region. The position of the three Identified SNP markers, 
AvrTa, and 25 predicted Avh genes are shown. B, Genetic analysis of the Avrlc region. The position of DNA markers and their recombination 
frequency (RF%) with Avrlc In a segregating F2 population (n = 28) Is Indicated; Mbp, mega base pair. C, Predicted arrangement of the Avrla locus In 
P. sojae reference strain P6497. 
dol:1 0.1 371 /journal.pone.0089738.g002 
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Figure 3. Genomic DNA blot hybridization showing P. sojae strain-specific deletions of Avrla and Avrlc. Selected strains of P. sojae 
genomic DNA were digested with Pm/l and separated by electrophoresis prior to blotting and hybridization. Virulence phenotype of the P. sojae 
strains on Rpslc or Rpsla plants Is shown as virulent (V) or avirulent (A). The positions and sizes of the Pm/l segments of /Aw/a, AvrTc, and Avh72 
genes are Indicated on the right. The sizes of DNA markers are shown on the left; kb, kilo base pair. The Avrlc gene Is deleted from parental strain 
P7076. 

dol:1 0.1 371 /journal.pone.0089738.g003 



PLOS ONE I www.plosone.org 



4 



February 2014 | Volume 9 | Issue 2 | e89738 



Phytophthora sojae RXLR Effector Avric 



16 



S 14 

01 
BO 

m 

o 12 



K 8 



ACRIO ■ 
P7076 ■ 



Avh72 




6 8 

Position (l<b) 



Figure 4. Sequence read coverage of the Avrla/Avric region from re-sequencing of P. sojae strains ACRIO and P7076. An assembly of 
the /Avr/a region from reference strain P6497 was used to align sequence reads from parental strains ACRIO and P7076. The DNA segment 
corresponding to the /Aw/c gene lacks sequence reads in parental strain P7076. The longest segment of identity between the ks/r'ia and hvr'ic DNA 
sequences is 98 nucleotides, which is shorter than the read length of 100 nucleotides. 
doi:1 0.1 371 /journal.pone.0089738.g004 



to test whether Avrlc can trigger cell death in an Rp.slc soybean 
plant, transient expression of Avrlc together with a reporter gene 
was performed in a co-bombardment assay [27]. Plasmid 
constructs Avrlc excluding the native signal peptide sequence, 
were bombarded into Rpslc soybean leaves along with glucuron- 
idase (GUS) reporter gene to measure cell death. This was done 
for each of the three Avrlc alleles in the soybean cv Williams (no 
known Rps genes) and the isoline L75-3735 [Rpslc). Expression of 
Avrlc^''^'^'' and Avrlc^^"'''' triggered cell death specifically in leaves 
of Rpslc plants, whereas Avrlc^''"''^ did not trigger cell death in 
Rpslc plants, as shown in Figure 7 and Figure 8. None of the 
alleles triggered cell death in control plants lacking the Rpslc gene. 



Results from this functional assay indicate that Avrlc 



and 



Avrlc'''"^*, but not Avrlc^™'^*, are recognized by Rpslc. We also 
tested the interactions between Avrla and Rpslc, and for each of 
the Avrlc alleles and Rpsla., since Avrlc and Avrla share 
considerable sequence identity. Results indicate that expression 
oi Avrla triggers cell death in Rpslc leaves, but that none of the 



Avrlc alleles trigger cell death in i?/),f la leaves, as determined by the 
co-bombardment assay (Figures 7, 8, 9). 

Discussion 

The soybean Rpslc gene is extensively deployed for the 
management of root and stem rot disease caused by P. sojae. 
Based on recent variety trial data shown in Table 2, Rpslc is the 
most prevalent source of resistance to P. sojae in the province of 
Ontario, where over 1 million hectares of land is annually 
dedicated to soybean production. The Rpslc gene is also widely 
deployed in the USA and other major soybean growing regions. 
Thus, identification of the Avrlc gene in P. sojae will have practical 
uses in soybean breeding and pathogen diagnostics. 

Genomic resources and annotation of RXLR effector reper- 
toires for oomycete plant pathogens have facilitated Avr gene 
identification in several species. Large scale effector screens have 
proven to be successful for identifying or predicting Avr 



Signal peptide RXLR 
I 1 I 1 

Avrlc_P6497 MRLTNTLVVAVAAILLASENAFSAATDADQATVSKFAAAEFDTLVDVLTTESKRSLRATV 60 

Avrlc_P7064 MRLTNTLWAVAAILLASENAFSAATDADQATVSKFAAAEFDTLVDVLTTESKRSLRATV 60 

Avrlc_P7074 MRLTNTLVVAVAAILLASENAFSAATDADQATVSKFAAAEFDTLVDVLTTESKRSLRATV 60 

Avrla_P6497 MRLTNTLVVAVAAILLASENAFSAATDADQATVSKLAAAEFDTLVDVLTTESKRSLRATV 60 
*********************************** . ************************ 

dEER WY domain 
I 1 I 1 

Avrlc_P6497 DDGEERYKLFKIEALQQGKWTSFFKKWSSNGRTESDIKKKLANVKWLSESDKSKIVDNYV 120 

AvrlcIP7064 DDGEERYKLFKIEALQQGKWTSFFKKWSSNGRTESDIKKKLANVKWLSESDKGRLVDNYG 120 
Avrlc_P7074 DDGEERYKQFKIEALQQGKWTPFFKKWSSNGRTESDIKKKLANVKWLSESDK GRLV DNYV 120 
Avrla_P6497 DDGEERYKQFKIEALKKGKWTDIFNKWKGNELSPAEVQNKLKNKK-LSDDLKDAIFRNYK 119 

******** ******..**** . + .** * ** * * **. * . ** 



Avrlc_P6497 
Avrlc_P7064 
Avrlc_P7074 
Avrla P6497 



FHLRGH 126 
FHLRGH 12 6 
FHLRGH 126 
DW 121 



Figure 5. Amino acid sequence alignment of the predicted proteins for Avr1 a and each of the three alleles of Avr1 c. The residues are 
colored according to their physicochemical properties. Signal peptide, RXLR and dEER motifs, and WY-domain are shown, and polymorphic residues 
among the three Avrlc alleles are underlined. An asterisk (*) indicates positions which have a single, fully conserved residue; a colon (:) indicates 
conservation between groups of strongly similar properties; a period (.) indicates conservation between groups of weakly similar properties. 
doi:1 0.1 371/journal.pone.0089738.g005 
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Figure 6. Analysis of alleles of Avrlc, and transcripts of Avrlc and Avrla in P. sojae strains. Shown at the top of the figure are the Avrlc 
alleles present In each of the selected P. sojae strains. Virulence phenotype of the P. soyae strains on Rpsi c or Rpsi a plants is shown as virulent (V) or 
avirulent (A). Reverse transcriptase polymerase chain reaction (RT-PCR) analysis was performed using mRNA from mycelia cultures, and Avrla and 
AvrTc specific primers, to test for transcripts of these two genes. Results from RT-PCR of the control gene Actin are also shown. 
doi:1 0.1 371 /journal.pone.0089738.g006 
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Figure 7. Transient expression of Avrlc and Avrla triggers cell death in Rps^c soybean plants. Measurement of cell death in soybean 
leaves by co-bombardment and transient expression of a GUS reporter together with a test gene. Test genes that cause cell death reduce GUS 
expression and blue staining. A, Leaves of Williams (rps) and the isoline L75-3735 (Rpslc) tested with each of the three Avrlc alleles, and Avrla. B, 
Leaves of Williams (rps) and the isoline L75-6141 (Rpsla) tested with each of the three Avrlc alleles. Control test gene in each experiment corresponds 
to a synthetic /\i/r;a sequence with a frame-shift mutation. Results show means and standard errors of three independent biological replicates, with a 
minimum of three leaves per treatment, per replicate. 
doi:1 0.1 371/journal.pone.0089738.g007 
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Figure 8. Photographs of leaves of Williams {rps) and the 
isoline L75-3735 (Rps1d\ tested with each of the three Avrlc 
alleles, and Avrla. Results from a representative co-bombardment 
experiment are shown, after staining for GUS expression. 
doi:1 0.1 371 /journal.pone.0089738.g008 



determinants in Phytophthora infestans [24,28] and Bremia lactucae 
[29], but this method is less usefixl in P. sojae because of the 
difficulty in performing Agrobacterium mediated infiltration and 
transient expression in soybean. However, being a homothaUic 
organism P. sojae offers certain advantages over predominately 
heterothallic species such as P. infestans and B. lactucae. For 
example, creating F2 populations for genetic mapping is generally 
possible for homothaUic but not heterothallic species. In fact, 
identification of all known P. sojae Am genes has relied on mapping 
in segregating F2 populations. Most recently, this strategy has been 
accelerated by pre-selection of candidate RXLR effectors [5,8,9]. 
The failure of the candidate gene approach in the present study 
was primarily a result of an incomplete RXLR inventory. To select 
candidates based on strain-specific polymorphisms, we rehed on 
previous work that described a comparative genome-wide analysis 
oi Avh genes among P. sojae strains [15]. Unfortunately, Avh275c 
was not included in the gene set because of poor annotation of 
tandem arrays in the reference genome. In addition to Avh275c, we 
also should have included known P. sojae Avr genes (or closely 
linked markers) in our initial screen, as this would have revealed 
co-segregation o{ Avrlc and Avrla. 

Of course, it is easier to select candidates in hindsight, and any 
approach of choosing genes a priori vn&. always carry a risk because 
it involves uncertainties. In contrast, de novo mapping of genes 
represents an unbiased and more robust method for target 
identification but can involve a greater research investment. Deep 
sequencing technologies combined with bioinformatics offer 
powerful tools for gene mapping. Previously, BSA has been 
combined with amplified fragment length polymorphism (AFLP) 
[26], and microarray analyses [3] for mapping genes in P. sojae. 
Our present scheme of combining BSA with whole genome re- 
sequencing is a natural step forward, especially since this technique 
has proven successful in other organisms [20,22]. With a high 
quality reference genome assembly, the path from finding linked 
markers to identifying target genes can proceed rapidly, as we have 
shown in this study. 
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Figure 9. Photographs of leaves of Williams (rp^ and the 
isoline L75-6141 [Rps^a) tested with each of the three Avrlc 
alleles. Results from a representative co-bombardment experiment are 
shown, after staining for GUS expression. 
doi:10.1371/journal.pone.0089738.g009 



Our conclusion that avirulence towards Rpslc is inherited as a 
dominant trait in crosses between P. sojae strains ACRIO xP7076 is 
consistent with the segregation results, despite the deviations that 
were observed. Previous inheritance studies have suggested that 
Avrlc is dominant [30] or recessive [31,32], but the strains used in 
past studies were different from those used in the present analysis. 
We show that the Avrlc gene is deleted from the i?/)jlc-virulent 
strain, P7076, which is consistent with avirulence being dominant 
in the cross ACRIO xP7076. Our finding that many P. sojae strains 
carry a copy oi Avrlc, yet do not express the gene, indicates that 
gain of virulence on Rpslc can also result from gene silencing. This 
could also explain past inheritance results that showed virulence 
towards Rpslc to be dominant in Fi hybrids [32], since it is known 
that gene silencing in P. sojae can cause transgenerational 
epigenetic inheritance that results in gain of virulence [33,34]. 
In fact, comparison of results of RT-PCR analyses suggests 
transcriptional switching of gene expression/silencing of Avrlc and 
Avrla in clonaUy propagated P. sojae cultures, because we noted 
some differences in expression between a previous study [3] and 
the present one, for strains ACR6, ACR9, ACR16, and ACR20. 
Variation in virulence of clonal P. .sojae and P. infestans cultures is 
well known [35,36]. The alternate loss and recovery of virulence in 



Table 2. Rps^ c is the most prevalent known source of P. sojae 
resistance In soybean lines entered in Ontario variety trialsV 





Rps gene(s) 


Number of lines 


Not known/undisclosed 


137 


Rpslc 


53 


Rps Ik 


36 


Rps3a 


6 


Rpsla 


5 


Rpslc -1- Rps3a 


3 


Rpslc -1- Rpslk 1 



'Source: 2012 Ontario Soybean Variety Trials. Data collected 2010-2012. 
Conducted by the Ontario Oil and Protein Seed Crop Committee 
www.gosoy.ca. 

doi:1 0.1 371 /journal.pone.0a89738.t002 
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successive clonal generations has puzzled investigators [35], but 
our results indicate that epiaUelic switching of effector gene 
expression is a mechanism that could account for this phenom- 
enon. 

Besides gene deletion and gene silencing, our results suggest that 
P. sojae can escape i?/wlc-mediated immunity by other means. The 
transient expression assay indicates that Avrlc^''*^'^, Avrlc^^'^''*, 
and Avrla are recognized by Rpslc plants but that Avrlc^'"'* is 
not. There are five amino acid polymorphisms between Avr 1 e''*''*^' 
and Avrlc^™'*, but three of these (S113G, K114R, I115L) are 
shared with Avrlc'""'', and one (L69Q) with Avrla. Thus, we 
hypothesize that the critical change to Avrlc^'"'* that enables it to 
escape Rpslc is the S82P mutation. Introduction of a proline 
residue represents a severe change with potential to alter the 
secondary structure in a region that is predicted to be alpha- 
hehcal, providing a rationak- for the importance of the S82P 
mutation. However, P. sojae strain P7074, which carries and 
expresses the Avrl<^^^^* allele, is avirulent towards Rpslc soybean 
plants. We propose that expression of Avrla by strain P7074 
triggers immunity on Rpslc, because our results show that Rpslc 
can detect Avrla in addition to Avrlc''^*'' and Avr 1 c'"'"*'*. We 
predict that a P. sojae strain expressing the Avrlc''^"^'' allele but 
lacking Avrla transcripts will be virulent towards Rpslc, but this 
remains to be demonstrated. 

Yet another mechanism to evade Rpslc immunity must be 
invoked to account for the finding that strain ACR9 is virulent on 
Rpslc plants. Results shown here indicate that ACR9 expresses 
transcripts corresponding to the Avrlc^^^"^ allele, which should 
trigger immunity on Rpslc based on the transient expression 
results, yet ACR9 is virulent on Rpslc plants. Each of these tests, 
mRNA analysis and phenotypic scoring, was replicated several 
times during the course of the present study with consistent results. 
Additionally, plants inoculated with strain ACR9 tested positive 
for Avrlc expression by RT-PCR (not shown). Given the results, 
how do we (explain the virulence of ACR9 on Rpslc plants? A 
possible explanation is that there are additional effectors in strain 
ACR9 that suppress ETI caused by the Avrlc-Rpslc interaction. 
It is known that particular RXLR effectors can suppress immune 
responses triggered by other RXLR effectors, and that potential 
Avr-R gene interactions can be masked by these apparent epistatic 
effects [15,37-41]. 

Epistatic interactions could also cause unusual inheritance 
behaviour, especially if combined with gene-conversion or mitotic 
crossing over events that commonly occur in hybrids from 
outcrosses of P. sojae [42] and other species of Phytophthora [43]. 
Thus, it is possible that the phenotypic penetrance or inheritance 
of Avrlc in particular hybrid progeny may be influenced by 
epistatic loci, gene conversion, and mitotic crossing over. This is in 
addition to the epigenetic affects discussed earlier. 

Our past work on Avrla demonstrated that gain of virulence on 
Rpslii is caused by gene deletion or gene silencing [3]. In contrast 
to Avrlc, no serjuence polymorphisms within the Avrla open 
reading frame were detected among the P. sojae strains that carry 
copies of Avrla. It is also evident that Avrlc and Avrla are 
individually and collectively dispensable, as are most known Avr 
genes in P. sojae. 

The transient expression assays suggest that Rpslc can 
recognize Avrlc and Avrla. This conclusion will require further 
work to verify, such as by performing transformation of P. sojae to 
demonstrate that avirulence to Rpsl& and Rpslc can be acquired 
by ectopic expression of Avrla. Another example of an apparent 
dual specificity of a soybean Rps gene was reported recently, 
specifically that Avr lb and Avr Ik are both capable of triggering 
Rpslk mediated immunity [6]. However, the caveat is that these 



suggestions are based upon genetic isolines that have been 
developed for the various Rps genes. Soybean Rps genes remain 
poorly characterized at the molecular level but it is known that the 
Rpsl locus consists of a large cluster of nucleotide-binding, leucine- 
rich repeat t)pe of immune receptor genes [44,45]. Thus, there are 
alternative possibilities to account for the apparent dual specificity 
of Rpslli. and Rpslc, such as separate but closely linked iJ-genes 
that recognize each of the Avr factors individually. Indeed, the 
variety trial data presented in Table 2 suggests the existence of a 
soybean line that possess both RpsYk and Rpslc, an occurrence that 
would be unlikely if these two genes are truly allelic. 

A model for the structure and expression of the Avrla/ Ic locus 
in different strains of P. sojae is presented in Figure 10. We 
acknowledge that our results present uncertainties with regard to 
this model, but we feel that sufficient evidence supports our 
conclusion that Avrlc occurs in the Avrla gene cluster and 
corresponds to the RXLR effector Avh275c. These findings have 
practical importance in soybean breeding and disease manage- 
ment, especially since Rpslc is among the most widely deployed 
sources of P. sojae resistance presently in use. 

Materials and Methods 

Plant materials, Phytophthora sojae strains and virulence 
scoring 

Soybean {Glycine max) cultivar Williams (rps) and the corre- 
sponding isolines L75-3735 [Rpslc), L75-6141 [Rpsla) from the 
collection at Agriculture and Agri-Food Canada were used to 
evaluate the virulence of pathogen cultures and for transient 
expression by particle bombardment. The cultivar Harosoy (Rpsl) 
and the corresponding isoUnes Harol4 (Rpslc) Harosoy 63 (Rpsla) 
were also used to replicate the experiments in a separate genetic 
background. For virulence assays, 10 soybean seeds were sown in 
10 cm pots containing soil-less mix (Pro-Mix 'BX', Premier 
Horticulture Ltd, Riviere-du-Loup, Canada). A total of three pots 
per isolate were tested for each replicate, and a minimum of three 
independent biological replicates were performed for each P. sojae 
culture tested. Pots were watered with 3 mg/L 20-20-20 (N-P-K) 
fertilizer. Plants were grown in a controlled growth chamber with 
supplemental light (16 h photoperiod with 25°C day and 16°C 
night temperatures) for one week before inoculations. 

Phytophthora sojae isolates were from the collection at Agriculture 
and Agri-Food Canada, London, ON. The origin of each strain 
has been described [3,7]. Performing sexual crosses of P. sojae 
strains ACR10xP7076 and the generation of F] and F2 progeny 
has been described [5,33]. All of the isolates were routinely 
maintained on 2.5'% (v/v) vegetable (V8) juice medium at 25°C in 
the dark. The P. sojae cultures were transferred to 0.9% (v/v) V8 
agar plates for 5 to 7 days prior to green plant inoculations. 
Methods of hypocotyl inoculation and virulence scoring have been 
described [3]. 

Nucleic acid isolation, RT-PCR and DNA blot hybridization 

The purification of RNA from P. sojae mycelium and transcript 
analysis by reverse transcriptase PGR (RT-PCR) was carried out 
as previously described [46] . Total RNA was treated with DNAse 
I (Invitrogen). The RT-PCR was conducted on 1 (Jg RNA using a 
reverse transcriptase (SuperScriptlll, Invitrogen) system according 
to the manufacturer's instructions. Genomic DNA from P.sojae 
mycelium was isolated by phenol chloroform extraction and 
ethanol precipitation. Genomic PGR amplifications were per- 
formed using 15 ng DNA, 0.5 (iM primers, 0.25 mM dNTPs, Taq 
polymerase and supplied buffer. The following PGR program was 
used: 94°C for 2 min, 40 cycles of 94°G for 40 s, 58°G for 40 s. 
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Figure 10. A model for the structure and expression of tlie Avrla/lc locus in P. sojae strains. Results from the present study are 
summarized in this illustration of the Avr1a/)c locus. Bold arrows indicate predicted genes; Avh72 is shown with a dashed line because this is 
predicted to be a pseudogene due to a frame shift mutation. ?or Avric and Avrla, an asterisk (*) indicates gene-silencing. Disease outcome on fipsic 
and Rpsia soybean plants is shown: avirulent (A); or virulent (V). Gene silencing or gene deletion can account for gain of virulence in the P. sojae 
strains shown here, except for strain ACR9. It is not known how ACR9 evades Rpslc recognition, but an epistatic effector that suppresses the ^vr7c- 
Rpslc interaction could explain the result. 
doi:l 0.1 371/journal.pone.0089738.g01 0 



72°C for 30 to 60 s (varied with the PGR product size) and a final 
extension of 72°C for 10 min. The P. sojae actin gene was used as a 
control. 

Genomic DNA isolation from P. sojae mycelial cultures, 
digestion with the restriction enzyme Pmll, electrophoretic 
separation, blotting and hybridization was performed according 
to standard protocols [47]. For hybridization, a DIG-labeled 
probe corresponding to the fuU length Avrla open reading frame 
was prepared. This probe hybridizes to Avrla, Avric, and Avh72. 

Transient expression assays 

Soybean plants wcic grown in the growth chambers as 
described above. Leaves from 14 d old plants were harvested for 
co-bombardment. Primers used for cloning Avric segments into the 
transient expression vector pFF 1 9 are provided in Supplementary 
Table S2. The three different Avric open reading frames 
(Avh275c''^*^', Avh275c''™''* and Avh275c^'"'*) and the Avrla 
open reading frame were amplified using specific primers and 
cloned into the 35S promoter-derived plant expression vector 
pFF19 using 5amHI/PrfI restriction sites. The signal peptides were 
omitted from all constructs. Soybean leaves were transformed by 
co-bombardment with plasmids encoding a glucuronidase (GUS) 
reporter gene and a test construct, using tungsten beads. Co- 
bombardment assays were performed as previously described [48] . 
For the control, the test gene corresponded to a frame-shift mutant 
oi Avrla. Leaves were washed in 70% ethanol and photographed 
using a digital camera. Sample images were obtained and 
processed, and the volume of GUS-positive spots was calculated 
as the intensity (measured in absorbance units) multiplied by the 
area (mm^) [5]. 

Deep sequencing and SNP analysis 

The P. sojae parental strains ACRIO and P7076 and a total of 12 
Fj individuals from the ACRIO XP7076 cross were selected, 
including six representing each virulence phenotype towards 
soybean Rpslc plants. The F2 cultures were used for creating Pooll 
(avirulent) and Pool2 (virulent). Cultures were grown and DNA 



purified, as described above, for each of the 12 F2 cultures and two 
parents. Equal amounts of DNA were combined to create 
composite samples of Pooll and Pool2. Library construction and 
deep sequencing (lUumina HiSeq 2000) of ACRIO, P7076, Pooll 
and P00I2 was performed by The Centre for Applied Genomics 
(Sick Kids Hospital, Toronto, Canada). In excess of 150 million 
sequence reads were obtained for each sample. 

The sequence reads from the two pools were trimmed and 
aligned with the P. sojae reference stain P6497 V3.0 assembly 
downloaded from the DOE-JGI website. Specifically the un- 
masked assembly was accessed on January 16 2012 at: 

http:// genome.jgi.doe.gov/Physo3/ download/Physo3_Assembly 
Scaffolds.fasta.gz. 

Sequence reads were mapped using the Burrows Wheeler 
Alignment (BWA) version 0.6. 1 to generate a Sequence Alignment 
Map (SAM) [49,50]. The SAM-tools version 0.1.18 was used to 
call SNPs to a binary variant call format (BCF) file which was 
filtered for SNP quality using vcfutHs.pl from the Samtools 
package. AUele frequencies specific to the non-reference base for 
each SNP were calculated from the DP4 values in the BCF file. 

Amino acid alignment was performed using the ClustalW 
algorithm (http:/ /www.ebi.ac.uk/Tools/msa/clustalw2/). The 
SignalP 4.0 server was used for secretion signal peptide prediction 
(http://www.cbs.dtu.dk/services/SignalP/). 

Data deposition 

The Avric sequences described in this study were deposited to 
NCBI GenBank, accession numbers KF661323, KF661324, and 
KF661325. Genome re-sequencing data for P. sojae strains ACRIO 
and P7076 is available at NCBI Sequence Read Archive, 
BioProject accession PRJNA230486. 

Supporting Information 

Figure SI Alignment of DNA sequences of Avric and 
Avrla. 

(PDF) 
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Table SI A list of 24 candidate Avh genes tested for 
co-segregation with Avrlc in the P, sojae cross 
ACR10XP7076. 

(PDF) 

Table S2 Primer sequences. 

(PDF) 
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